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ABSTRACT 

Splicing of precursor messenger RNA is a hallmark 
of eukaryotic cells, which is carried out by the 
spliceosome, a multi-megadalton ribonucleoprotein 
machinery. The splicing reaction removes non- 
coding regions (introns) and ligates coding regions 
(exons). The spliceosome is a highly dynamic 
ribonucleoprotein complex that undergoes 
dramatic structural changes during its assembly, 
the catalysis and its disassembly. The transitions 
between the different steps during the splicing 
cycle are promoted by eight conserved DExD/H 
box ATPases. The DEAH-box protein Prp43 is re- 
sponsible for the disassembly of the intron-lariat 
spliceosome and its helicase activity is activated 
by the G-patch protein Ntr1. Here, we investigate 
the activation of Prp43 by Ntr1 in the presence and 
absence of RNA substrate by functional assays and 
structural proteomics. Residues 51-110 of Ntrl were 
identified to be the minimal fragment that induces 
full activation. We found protein-protein cross-links 
that indicate that Prp43 interacts with the G-patch 
motif of Ntrl through its C-terminal domains. 
Additionally, we report on functionally important 
RNA binding residues in both proteins and 
propose a model for the activation of the helicase. 

INTRODUCTION 

During eukaryotic gene expression, defined parts of the 
chromosomal DNA are transcribed to precursor messen- 
ger RNAs (pre-mRNAs), which are processed before 
being exported to the nucleus. The processing includes 



the addition of a 5'-cap as well as a 3'-polyA tail (1,2) 
and splicing. Splicing of pre-mRNA consists of the 
removal of non-coding sequences (introns) and the 
ligation of expressed sequences (exons), which result in a 
mature mRNA (3). Splicing of pre-mRNAs requires two 
consecutive transesterification reactions and is catalyzed 
by the spliceosome, a multi-megadalton and highly 
dynamic ribonucleoprotein (RNP) complex (3,4). The 
spliceosome is formed by five uridine-rich small nuclear 
RNAs (U snRNAs) and numerous proteins; a subset of 
these is stably associated with the U snRNAs, making up 
the respective small nuclear RNPs (U snRNPs). The Ul, 
U2, U4/U6 and U5 snRNPs are part of the major (or U2- 
dependent) spliceosome. In some genes of metazoans and 
plants, an additional intron class termed U12-type introns 
has been found; these are spliced by the minor 
spliceosome, which comprises the Ull, U12, U4atac/ 
U6atac and U5 snRNPs (5,6). 

The spliceosome is a single-turnover enzymatic 
machine, which assembles on its substrate, catalyzes the 
splicing reaction and dissociates afterwards, with multiple 
changes in its protein composition during the different 
steps. Thus, at different stages of the splicing cycle, the 
spliceosome varies greatly in protein composition, with 
the number of implicated proteins averaging between 
110 and 180 in metazoans (7). 

The dynamics of the spliceosome are controlled by eight 
conserved DExD/H-box ATPases, which promote the 
events leading to the assembly of the spliceosome, the ca- 
talysis of the splicing reaction and the dissociation of the 
intron-lariat spliceosome (8,9). The term "DExD/H" is 
based on a conserved amino acid sequence in the 
helicase motif 2. The DEAD-box protein Sub2 (UAP56 
in humans) recruits the U2 snRNP to the spliceosome 
(10), which is then partially remodelled by the DEAD- 
box protein Prp5 (11). Prp28, an ATPase that also 
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contains the DEAD-motif, is responsible for the release of 
the Ul snRNP (12). The Ski2-hke ATPase Brr2 plays a 
role in the release of the U4 snRNP during spliceosomal 
assembly (13). 

During the catalysis of the splicing reaction, the DEAH- 
box ATPases Prp2 and Prpl6 are required for the release 
of the sphcing proteins SF3a/SF3b and Yju2/Cwc25, re- 
spectively (14-16). Prp22, which is also a DEAH-box 
protein, is needed during the second transesterification 
step and also catalyzes the release of the mature mRNA 
(17,18). Finally, the DEAH-box ATPase Prp43 is involved 
in the disassembly of the post-spliceosome, leading to the 
release of the intron lariat, the U6 snRNA, a 20-25 S U2 
snRNP and an 18 S U5 snRNP (19,20). It has also been 
shown that Prp43, in addition to its function during 
sphcing, is involved in the biogenesis of the small and 
the large ribosomal subunit (21,22) 

Additionally, the spliceosomal DExD/H-box ATPases 
Prp5, Prpl6, Prp22, Prp28 and Prp43 have also been sug- 
gested to be involved in ensuring the fidelity of splicing 
and acting as proofreaders (23-25). 

Like all members of the superfamily two helicases, the 
spliceosomal DExD/H helicases contain a central helicase 
core that is formed by two RecA-Hke domains (26,27) and 
is responsible for nucleotide binding and the interaction 
with the RNA (28). In addition, there are additional 
domains present in these enzymes that have been shown 
to regulate their activity or to act as platforms for inter- 
action partners, as their action during the splicing cycle 
has to be precisely regulated (9,29). 

Two of the spliceosomal DEAH-box ATPases, Prp2 
and Prp43, have been shown to interact during the 
splicing cycle with proteins that contain a G-patch 
motif, namely, Prp2 with Spp2 and Prp43, with Ntrl 
(30,31). The G-patch motif encompasses ~50 residues, 
among which six are highly conserved glycines; it is pre- 
dominantly found in proteins associated with nucleic acid 
interactions (32). 

Although no effect of Spp2 on the enzymatic activities 
of Prp2 has been reported so far, the G-patch motif con- 
taining part of Ntrl strongly stimulates the helicase 
activity of Prp43, and hence is regarded as an activator 
of Prp43 (33). Recently, the crystal structure of Prp43 
from Saccharomyces cerevisiae (yPrp43) in complex with 
adenosine diphosphate (ADP) revealed that it shares some 
features with the processive DNA-helicase Hel308 (34- 
36). In yPrp43, the two RecA-like domains are followed 
by a winged hehx, a ratchet and an OB-fold domain; an N- 
terminal extension connects the first RecA-like domain 
with the C-terminal (NTE) OB-fold domain. The three 
C-terminal domains of yPrp43 are also present in human 
Prp22 and seem to be a conserved feature of all four 
spliceosomal DEAH-box proteins (37). Although this 
crystal structure is the first reported for a DEAH-box 
protein and provides insights into the domain organiza- 
tion of this family, it explains neither the binding site of 
the G-patch motif Ntrl nor the structural changes that 
transform yPrp43 into its active form. Furthermore, no 
3D structure of a G-patch motif is known to date. 

Understanding the detailed dynamics that underhe 
helicase activation requires extensive structural and 



biochemical studies. Here, we show that the G-patch 
motif of Ntrl is intrinsically unstructured but can adopt 
secondary-structure elements on binding to Prp43. In 
addition, we identified the minimal fragment needed for 
activation. We then set out to investigate structural 
changes associated with the binding of Ntrl to Prp43 by 
applying a series of structural probing techniques in com- 
bination with mass spectrometry. These are protein- 
protein (38), protein-RNA cross-linking (39) and 
hydroxyl-radical footprinting (40), all of which have 
recently proven to be useful tools in the investigation of 
multiprotein and RNP complexes. We were able to define 
the sites of the interaction between Prp43 and Ntrl at a 
molecular level and, furthermore, results of hydroxyl- 
radical probing led us to suggest changes in the conform- 
ation of Prp43 on Ntrl binding. In addition, our protein- 
RNA cross-hnking data demonstrate that (i) Ntrl is in 
direct contact with the RNA through its G-patch motif 
and (ii) on binding to Ntrl in the presence of RNA, the 
ratchet domain of Prp43 becomes more exposed and avail- 
able for cross-hnking as compared with Prp43 bound to 
RNA alone. 



MATERIALS AND METHODS 

Molecular cloning 

Prp43 was amplified from genomic yeast DNA (S. 
cerevisiae S288c) using the oligonucleotides Prp43-for 
and Prp43-rev (Supplementary Table SI for the sequences 
of all oligonucleotides) and inserted into a pENTRY- 
IBAIO vector following the IBA StarGate manual 
(version December 2008) from IBA GmbH. 
Subsequently, the open reading frame (ORE) of Prp43 
was transferred into a pASG-IBA3 plasmid, adding a C- 
terminal strep affinity sequence to the protein. 

Ntrl(l-120) was amplified from genomic yeast DNA 
using the ohgo nucleotides Ntrl -for and Ntrl(120)-rev 
and inserted into a pENTRY-IBAlO vector following 
the IBA StarGate manual. Subsequently, the ORE of 
Ntrl(l-120) was transferred into a pPSG-IBA5 plasmid, 
adding an N-terminal strep affinity sequence to the 
protein. 

Ntrl (5 1-1 10) was amphfied from genomic yeast DNA 
using the oligonucleotides Ntrl(51)-for and Ntrl(110)-rev 
and inserted into a pENTRY-IBAlO vector following the 
IBA StarGate manual. Subsequently, the ORF of 
Ntrl(51-110) was transferred into a pASG-IBA25 
plasmid, adding an N-terminal glutathione S-transferase 
(GST) affinity sequence to the protein. 

For the cloning of expression plasmids for the fusion 
proteins, the sequences of Prp43 and Ntrl -fragments were 
amplified from genomic yeast DNA. (Parts of) the sequences 
of Prp43 and Ntrl were amplified for the following vectors: 
pETM-13-Prp43-Ntrl(51-110) (oligonucleotides Prp43- 
IF-for, Prp43-IF-51-rev, Ntrl-51-FL-for and Ntrl-110- 
rev), pETM-13-Prp43(l-745)-Ntrl(51-110) (ohgonucleo- 
tides Prp43-IF-for, Prp43-745-IF-51-rev, Ntrl-51-745-for 
and Ntrl-llO-rev), pETM13-Prp43-Ntrl(51-102) (oligo- 
nucleotides Prp43-IF-for, Prp43-IF-51-rev, Ntrl-51-FL-for 
and Ntrl-102-rev), pETM13-Prp43-Ntrl(51-99) 
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(oligonucleotides Prp43-IF-for, Prp43-IF-51-rev, Ntrl-51- 
FL-for and Ntrl-99-rev), pETM13-Prp43-Ntrl(59-110) 
(oligonucleotides Prp43-IF-for, Prp43-IF-59-rev, Ntrl-59- 
FL-for and Ntrl-llO-rev) and pETM13-Prp43-Ntrl(62- 
110) (oligonucleotides Prp43-IF-for, Prp43-IF-62-rev, Ntrl- 
62-FL-for and Ntrl-llO-rev). The respective fragments were 
transferred into a Ncol/Sall (Fermentas)-digested pETM-13 
expression vector using the In-Fusion technique (Clontech) 
and following the user manual (version October 201 1), and a 
C-temiinal Strep affinity sequence was included in the 
sequence of the reverse oligonucleotides. 

Site-directed mutagenesis of the plasmid pETM- 
13_Prp43-Ntrl(51-110) was performed by plasmid amph- 
fication via polymerase chain reaction using the primer 
pairs Prp43_K418E-for/Prp43_K418E-rev, Prp43_Y59 
5 A-for/Prp43_Y595 A-rev, Prp43_Y6 1 0 A_S6 1 4A-for/Prp 
43_Y6 1 0A_S6 1 4A-rev, Prp43_Y630A-for/Prp43_Y630 A- 
rev and Ntrl_K67E-for/Ntrl_K67E-rev, respectively. 

Protein expression and purification 

The plasmid pASG-lBA3-Prp43 was transformed into 
Rosetta2 (DE3) cells (Novagen). Expression cultures of 
ten 2-L flasks containing 500 ml 2YT media were grown 
at 37°C at 220 rpm until an OD600 of 0.6 was reached. 
The expression of Prp43-Strep was induced by adding 
200 (ig anhydrotetracychn per htre of expression culture, 
and the ceU cultures were incubated at 16°C and at 
220 rpm overnight. 

Cells were harvested by centrifugation at 4000 x g for 
15 min at 4°C. The cells were resuspended in buffer A 
(400 mM NaCl, 50niM Tris/HCl pH 7.5, 2mM MgCy 
and were disrupted using a microfluidizer (Microfluidics). 
The soluble fraction was separated from the unsoluble 
fraction by centrifugation at 30000 x g for 45 min at 4°C. 

Prp43-Strep was bound to a StrepTactin HP Sepharose 
column (GE Healthcare) and eluted by buffer A supple- 
mented with 2mM D-desthiobiotin. Prp43-Strep was 
purified to homogeneity using a Superdex 200 (26/60) 
column (GE Healthcare) in buffer S (100 mM NaCl, 
10 mM Tris pH 7.5, 2mM MgCh)- 

The Prp43-Ntrl fusion proteins were expressed, har- 
vested and purified in the same way as described for 
Prp43, with the only difference that the protein expression 
was induced by adding isopropyl P-D-l-thiogalacto- 
pyranoside (IPTG) to a final concentration of 1 mM to 
the expression culture instead of anhydrotetracychn. 

Strep-Ntrl(l-120) and GST-Ntrl(51-1 10) were ex- 
pressed and harvested as described for Prp43, with the 
differences that the expression of Ntr 1(1-1 20) was 
induced by adding IPTG to a final concentration of 
1 mM to the expression culture and that the expression 
cultures were incubated after induction at 30°C at 
220 rpm for 5h for both Ntrl truncations. 

Strep-Ntrl(l-120) was isolated from the supernatant 
using a StrepTactin HP Sepharose column in buffer A. 
The elution fractions were desalted into buffer B (50 mM 
NaCl, 10 mM Tris/HCl, pH 7.5, 2mM MgCy using a 
HiPrep Desalting 26/10 column (GE Healthcare) and 
bound to a Source 30 Q column (GE Healthcare), from 
which the pure protein was eluted by a gradient of buffer 



C (600mM NaCl, lOOmM Tris/HCl pH 7.5, 2mM 

Mgcy. 

GST-Ntrl(51-110) was isolated from the soluble 
fraction using a GSH Sepharose 15-ml column (GE 
Healthcare), from which it was eluted by buffer A supple- 
mented with 30 mM reduced glutathione. The GST 
affinity sequence was removed by incubation with 
0.01 mg PreScission Protease (GE Healthcare) per Img 
protein. Ntrl(51-110) was separated from the GST by 
performing a size exclusion chromatography using a 
Superdex 75 (26/60) column (GE Healthcare) in buffer 
S. For all protein samples, the A260/A280 ratio was 
determined after the last purification step to analyse any 
potential nucleic acid contamination. During establishing 
of the purification protocols, it was observed that an add- 
itional final Source 30 Q column purification step neither 
changed the A260/A280 nor the ATPase activities, 
showing that no nucleic acid was present in the samples. 

Complex formation 

The complexes were formed by incubation of Prp43 with a 
5-fold molecular excess with the Ntrl -fragment and 
further purified by a S200 size exclusion chromatography 
in buffer S. For the protein-protein cross-hnking and the 
hydroxyl radical experiments, the complexes were 
transferred into buffer M (lOOmM NaCl, lOmM 
sodium phosphate buffer, pH 7.4, 2mM MgCU) using a 
HiPrep Desalting 26/10 column (GE Healthcare). 

Determination of tlie ATPase activity 

The ATPase assay was performed as described previously 
(37), with the difference that 5 |iM of A20 RNA was used 
and that the measurements were performed in buffer S. 

Determination of the helicase activity 

The unwinding assay was performed by fluorescence spec- 
troscopy as described previously (41). For the strand an- 
neahng, 4.5 |iM of the two RNA strands (5'-GCGCCUAC 
GGAGCUGGUGGCGUAGGCGCAAAAAAAAAAA 
AAAAAAAAA-3' and 5'-(Cy5)-GCGCCUACGCCACC 
AGCUCCGUAGGCGC-(BBQ)-3', IBA GmbH) was 
supplemented with 1 \x\ lOOmM Tris, pH 8.0, incubated 
at 80°C for 5 min and cooled down to 20°C with a step 
size of 3°C/min. Subsequently, the sample was diluted to a 
1 |.iM double-stranded RNA (dsRNA) solution. 

The 150^1 reaction mixtures contained 20 nM protein, 
50 nM dsRNA, 10 mM Tris, pH 7.5, 100 mM sodium 
chloride and 2mM MgCl2. The samples were incubated 
at 20° C in the fluorescence spectrometer Fluoromax III 
(Horiba Jobin Yvon) for 15 min, and the measurements 
were started by adding ATP to a final concentration of 
1 mM. The excitation wavelength was set to 643 nm, and 
the fluorescence at 667 nm was measured every 2 s during 
10 min. For the determination of the maximal reaction 
velocity (Vmax)> the initial slope Vo (AF/min) of the fluor- 
escence was determined three times independently for each 
sample. The amount of unwound dsRNA was calculated 
with respect to the signal at 667 nm for the annealed and 
completely unwound dsRNA, respectively. 
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Determination of dissociation constants 

For the determination of dissociation constants, 1 [iM 
Prp43 was incubated with the Ntrl-fragment in concen- 
trations between 0.1 and 12 ^M. Each sample was 
incubated for 2h, and subsequently the ATP hydrolysis 
rate at 100 mM ATP at 20°C was determined. For each 
concentration, three independent measurements were per- 
formed. The ATPase activity was plotted against the con- 
centration of the Ntrl-fragment and fitted to a hyperbola 
using SigmaPlot 11.0 (Systat Software). The was 
directly deduced from the equation of the hyperbola fit. 

Far-UV circular dichroism spectroscopy 

The protein sample was desalted in buffer CD (lOOmM 
sodium phosphate, pH 7.5) and diluted to O.lnig/ml. 
Three hundred microhtres of the sample was pipetted 
into a cuvet with a path length of 1 mm and transferred 
into a Chirascan CD spectrometer (AppliedPhotophysics). 
The spectra were recorded in far-ultraviolet (UV) range 
from 195-260 nm at 20°C with a time-per-point value of 
1 s and a path length of 1 nm. Every measurement was 
performed for five times independently. For the final ab- 
sorption curve, the values of the negative control, which 
contained exclusively the sample buffer, were subtracted 
from the values of the protein sample. Trifluorethanol 
(TEE) was added in concentrations between 10 and 40% 
(v/v) to the samples. 

Secondary structure prediction 

The lUPRED server was used to predict disordered 
regions in proteins by the calculation of unfavourable 
estimated energies (42). The ANCHOR server combines 
the disorder tendency predicted by lUPRED with the sen- 
sitivity to structural environment, and thus identifies 
protein regions that might form defined secondary struc- 
ture elements when binding to a globular protein (43). 

Hydroxyl radical probing and digestion of peptides 

Proteins were probed by chemically generated hydroxyl 
radicals, as described in (44). Seventy microhtres of a 
50-|iM protein complex solution was added to a reaction 
tube containing 10|il of 13mM Fe-ethylenediaminete- 
traacetic acid (Sigma-Aldrich; Merck, respectively) 
solution and 10^1 of 0.2 M sodium ascorbate (Sigma- 
Aldrich) solution, both in 50 mM Na2HP04 buffer, pH 
6.5. For control purposes, a 17|il ahquot was transferred 
to 40^x1 quenching buffer (2M Tris, pH 7.2, Prolabo). 
Finally, 10|.d of 3% v/v H2O2 (Sigma-Aldrich) was 
added to the Fenton mixture to start the reaction. At 10, 
60, 150 and 300 s, 17|il aliquots were removed and 
transferred to new microcentrifuge tubes containing 40 (il 
quenching buffer. 

Protein pellets were dissolved and denatured in a 50 |il 
acetonitrile:buffer (lOOmM ammonium bicarbonate) 
mixture (80:20), by intense vortexing, followed by a 
10-min incubation in a thermomixer at 25° C and 
500 rpm. Fifty microhtres of a 2-mM dithiothreitol 
(DTT) solution in 80% acetonitrile (ACN) was added. 
The sample was incubated for 30 min at 25° C and 



500 rpm. To alkylate cysteine residues, 50 (il of a 12-mM 
iodoacetamide solution in 80% ACN was added and 
incubated for further 30 min at 25°C and 500 rpm. 
Samples were then predigested with Lys-C (Roche 
Diagnostics GmbH), added in a 1/20 w/w enzyme to 
protein ratio and incubated for 3h at 25°C and 500 rpm. 
Trypsin (Promega) was added in a 1/20 w/w enzyme to 
protein ratio, and the sample incubated overnight under 
the same conditions. Samples were dried in a vacuum con- 
centrator and stored at — 20°C until posterior use (45). 

UV-induced cross-linking of y-|32P|-ATP-labeiIed RNA 
and Prp43/Ntrl complex 

The single-stranded RNA oligonucleotide (5'-GACGGG 
CUGUUGGUGAUGGAC-3', Roche Kulmbach GmbH) 
was radiolabelled with y-[32P]-ATP (6000Ci/mmol) and 
1 \xL T4 polynucleotide kinase following standard proced- 
ures. The resulting labelled RNA was incubated for 30 min 
on ice with ~50-fold molar excess of Prp43/Ntrl(l-120). 
The resulting protein-RNA complex divided into 6.8 |il 
ahquots and irradiated at 254 nm for 0, 1, 2, 5, 10 and 
12 min, using an in-house constructed cross-linking device, 
including 4 8W lamps (F8T5BL Sankyo Denki). The 
cross-hnked samples were analysed by sodium dodecyl 
sulphate-polyacrylamide gel electrophoresis (SDS- 
PAGE), and the resulting gel was coomassie-stained and 
bands containing radiolabelled RNA were visuahzed with 
a STORM phosphoimager (GE Healthcare). 

UV-induced protein-RNA cross-linking and enrichment 
of cross-linked peptides 

UV cross-linking and enrichment were performed as 
described in (46). Briefly, 1 nmol of the single-stranded 
RNA ohgonucleotide 5'-GACGGGCUGUUGGUGAU 
GGAC-3' was incubated in a 1:1 molar ratio with either 
Prp43/Ntrl or Prp43 on ice for 30 min. The resulting 
mixture was taken to 100^1 in lOmM Tris, pH 7.5, 
100 mM NaCl, 2mM MgCb buffer, transferred to black 
polypropylene niicroplates (Greiner Bio-One) and 
irradiated at 254 nm for 10 min. Samples were ethanol 
precipitated, denatured in 4M urea, 50 mM Tris HCl, 
pH 7.9, and digested for 2h at 52°C with 1 \ig RNase A 
and 1 U RNase Tl (Ambion, Applied Biosystems). RNA 
digestion was foUowed by overnight trypsin proteolysis at 
37°C. Samples were desalted on a C18 (Dr Maisch 
GmbH) column. Enrichment of cross-hnked peptides 
was performed on a Ti02 (GL Sciences) column. Both 
columns were prepared in-house, and the enrichment 
was achieved essentially as in the procedure described in 
(46); after evacuation to dryness, the sample was dissolved 
in 200mg/ml 2,5-dihydroxy benzoic acid (Sigma-Aldrich) 
in 80% ACN containing 5% trifluoroacetic acid; it was 
then loaded on the Ti02 column, washed with 200 mg/mL 
dihydroxy benzoic acid in 80% ACN, 5% trifluoroacetic 
acid and eluted with 0.3 M ammonia. 

Protein-protein cross-linking 

The optimal cross-hnker to protein ratio was determined 
by using 2.5 |ig aliquots of the Prp43/Ntrl complex and a 
series of cross-hnker molar excesses of 5, 10, 25, 50, 100 
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and 200 as well as a control. Samples were allowed to react 
with freshly prepared BS3 for 30 min at room tempera- 
ture. The chosen cross-linker to protein ratio was 30, 
which led to a high yield, but to a homogeneous sharp 
band as visualized on SDS-PAGE. The cross-linked 
samples were analysed by SDS page on a 4-12% Bis- 
Tris gel (Invitrogen) with MES as running buffer and 
stained with coomassie blue. 

For MS analysis, both Prp43/Ntrl(l-120) and Prp43/ 
Ntrl(5 1-110) were cross-Unked with freshly prepared BS3 
and BS2G (Pierce, Thermo Scientific) in a 20:1 and 30:1 
cross-linker to protein ratio, allowed to react and separated 
as described earlier. Cross-linked bands were excised from 
the gel and combined in a single reaction cup according to 
cross-linking reagent. In-gel digestion and extraction of 
peptides was achieved as described elsewhere (47). The 
solution of extracted peptides was concentrated on a 
vacuum evaporator to 5 |.il and diluted to a final volume 
of 10|.il to reach a sample solvent composition of 5% v/v 
ACN and 1 % v/v FA. Samples were submitted for imme- 
diate analysis to the mass spectrometer. 

Nano-liquid chromatography and mass 
spectrometric methods 

For liquid chromatography-tandem mass spectrometry 
(LC-MS/MS) analyses, hydroxyl radical footprinting and 
UV-induced cross-hnking samples were dissolved in 30 
and 10|il sample solvent (5% v/v ACN, 1% v/v FA), 
respectively. Five microlitres was injected onto a nano- 
hquid chromatography system (Agilent 1100 series, 
Agilent Technologies) including an ~2-cm-long, 150-|im 
inner diameter CI 8 trapping column in-line with an 
~15-cm-long, 75-|rm inner diameter CI 8 analytical 
column (both packed in-house, CI 8 AQ 120A5|im, 
Dr Maisch GmbH). Peptides were loaded on the 
trapping column at a flow rate of 10|iL/min in buffer A 
(0.1% FA in H20, v/v) and subsequently eluted and 
separated on the analytical column with a gradient of 
7-38% buffer B (95% acetonitrile, 0.1% FA in H2O, 
v/v) with an elution time of 33 min (0.87%/min) and a 
flow rate of 300 nl/min. 

Onhne electrospray mass spectrometry was performed 
with an LTQ-Orbitrap Velos instrument (Thermo 
Scientific), operated in data-dependant mode using a 
TOPIO method. MS scans were recorded in the range of 
350-1 600 m/z. The 10 most intense ions were selected for 
subsequent MS/MS. Both precursor ions and well as 
fragment ions were scanned in the Orbitrap. Fragment 
ions were generated by HCD activation (higher energy 
colhsion dissociation, normalized coflision energy = 40), 
and recorded from m/z = 100. As precursor ions as well 
as fragment ions were scanned in the Orbitrap, and the 
resulting spectra were measured with high accuracy 
(<5ppm), both in the MS and MS-MS levels. 

Both protein-protein and protein-RNA cross-hnking 
samples were submitted to LC-MS analysis immediately 
after their preparation. Nano-LC chromatography was 
performed as described earlier, except for the gradient, 
which extended over 97 min. Mass spectrometric 
analysis of protein-protein cross-finked peptides was 



performed in the same instrument, with a TOP8 method 
in data-dependent acquisition mode. MSI ions were 
recorded in the range of 300-1 800 m/z at 60000 reso- 
lution. Fragmentation was generated by colhsion 
induced activation and only precursor ions of charge 
state >3 were selected for fragmentation. Fragment ions 
were acquired in the Orbitrap at 7500 resolution. Dynamic 
exclusion was set at 30 s with a repeat count of 1 . 

Data analysis 

Hydroxy I radical footprinting 

Data analysis was performed using MaxQuant 1.1.1.14 
(48). Database search was performed against Prp43 and 
Ntrl (1-120) sequences, with Andromeda considering 14 
variable modifications: hydroxylation (+16 Da) in Hys, 
Val, Met, Leu, Tyr, Arg, Pro, He, Phe; dioxidation (+32 
Da) of Met, Tyr, Phe; deguanidination (—43 Da) in Arg 
and ring-opening (—22) in His (49). By making use of ex- 
tracted ion chromatograms, obtained with the label-free 
algorithm from MaxQuant, the ratios of oxidized to non- 
oxidized peptides in which a specific residue is involved 
were determined, and thus the dose-response plots for 
modified peptides were obtained (50). 

UV-induced protein-RNA cross-linking 

Protein-RNA cross-hnking experiments were analysed 
with OpenMS (51,52) and OMSSA (53) as search 
engine. Data analysis workflows were assembled especially 
for our laboratory by Timo Sachsenberg (Prof. Oliver 
Kohlbacher, Apphed Bioinformatics Group, Eberhard 
Karls University, Tubingen), based on (46) and a manu- 
script in preparation. MS. raw files were converted into 
the.mzML format with msconvert (54). High-scoring can- 
didate spectra were manually annotated to completion. 

Protein-protein cross-linking 

Data analysis was performed with MassMatrix as 
described in their publications and manuals (55,56). 
Thermo Scientific raw files were converted to the 
mzxmL data format with MMConverter and submitted 
to database search with the following parameters: 
peptide length between 6 and 40 amino acid long, 
lOppm MSI tolerance and 0.02 Da MS2 tolerance. 
Oxidation in methionine was set as a variable modifica- 
tion, whereas carbamidomethylation of cysteine was set as 
a fixed modification. Spectra were searched against a 
FASTA file composed of the sequences of Prp43 and 
Ntrl (1-120) and a reversed decoy database. The resulting 
matches were validated manually. 

RESULTS 

The G-patch motif of Ntrl is intrinsically unstructured 

Far-UV circular dichroism spectroscopy (CD) was used to 
analyse the content of secondary structure elements of 
truncated Ntrl(51-1 10). The CD-spectrum of this Ntrl- 
fragment indicated the absence of any secondary structure 
elements (Figure lA). The capacity of evolving secondary 
structure elements was investigated by determination of 
far-UV CD spectra in presence of the small molecule 
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Figure 1. Secondary structure analysis of Ntrl. (A) CD spectra of Ntrl(51-110) at different concentrations (0-40% v/v) of TFE. The circular 
dichroism (y-axis) is shown with respect to the wavelength (x-axis). (B) Prediction of the presence of disordered binding regions (dashed line) and 
unstructured regions (continuous line) for full-length Ntrl. The probability to be unstructured or a disordered binding region, respectively (y-axis), is 
shown with respect to the residue number (x-axis). 



TFE at different concentrations. TFE is known to support 
the formation of alpha-helices (57). For concentrations of 
>20% (v/v) of TFE, the CD spectrum of Ntrl(51-110) 
changes significantly and gives rise to two local minima 
at 208 and 222 nm, indicating the presence of alpha-helices 
(Figure lA). 

Hence, the CD spectra suggest that Ntrl(51-110) is 
natively unstructured, and that it has the capacity to 
form alpha-heUces. The signal for beta-strands in a CD 
spectrum is detected by a decrease in signal between 200 
and 210 nm and is equal to the signal for an alpha-helix in 
this range, and the presence of a beta-strand in yNtrl(51- 
110) cannot be proved in the measured CD spectra. 

To learn whether the described potential secondary 
structure elements could be evolved on binding of 
Ntrl (5 1-1 10) to Prp43, an additional bioinformatical 
approach using ANCHOR and lUPRED (42,43) was 
chosen. Although the predicted disorder probability is 
high for the 150N-terminal residues, it is on a low level 
for the residues 151-600 and increases again for the last 
100 C-terminal residues (Figure IB). Three regions are 
predicted to be disordered binding regions: residues 
11-20, 60-77 and 102-107; of these, the second and 
third are part of the minimal required activating 
fragment Ntrl(51-1 10). Residues 60-77 overlap with a 
region predicted to form alpha-helices — interestingly, 
this region is predicted to be unstructured at a probability 
of 0.3, which is lower compared with the residues 1-150, 
but still quite high compared with the region between 1 50 
and 600. 

Combining the experimental data collected by far-UV 
CD spectroscopy with the bioinformatical predictions, the 
following structural model for the G-patch motif of Ntrl 
can be outhned: the residues forming the G-patch of Ntrl 
are intrinsically unstructured, but they have characteristics 
indicating a tendency to form secondary structure 
elements, namely, alpha-heUces and beta-strands. The for- 
mation of these secondary structure elements could be 
induced on binding to the globular interaction partner, 



the spliceosomal helicase Prp43. Regarding full-length 
Ntrl, the G-patch containing N-terminal 150 residues 
and the C-terminal 100 residues are predicted to be un- 
structured, whereas the central part of the protein 
(residues 150-600) seems to form a structured region. 

Ntrl(51-110) is the minimal functional fragment required 
for the activation of Prp43 

Prp43 and Ntrl(l-120), an Ntrl-fragment that has been 
reported as sufficient for the stimulation of the hehcase 
activity of Prp43 (31), were expressed and purified separ- 
ately. The complex of Prp43 and Ntrl(l-120) was formed 
by mixing both proteins and further purified via a size 
exclusion chromatography (Supplementary Figure S2A 
and B). By expressing, purifying and characterizing differ- 
ent N-terminal fragments of Ntrl, it turned out that the 60 
amino acids containing peptide Ntrl(51-110) were still 
able to bind to Prp43 (Supplementary Figure S2C and 
D) and were sufficient to stimulate the ATPase and 
helicase activity of Prp43. Furthermore, it could be 
shown that both Ntrl truncations were able to stimulate 
the ATPase activity of Prp43, even in absence of RNA 
(Figure 2 A, B and E and Tables 1 and 2). Although the 
maximal ATP hydrolysis rate of Prp43 was 
2.43 ± 0.12nM/min in absence and 36.47 ± 0.51 ^M/min 
in presence of RNA, it was stimulated to a similar extent 
in complex with Ntrl(l-120) (11.18 ±0.34nM/min and 
166.48 ± 2.38 nM/min, respectively) and Ntrl(51-110) 
(9.54 ± 0.37 nM/min and 153.40 ± 3.08 ^M/min, 
respectively). 

As the expression and purification of soluble shorter 
fragments of Ntrl was not feasible, they were fused 
directly to the C-terminus of Prp43. Interestingly, the 
fusion of Prp43 and Ntrl(51-110) [referred to as Prp43- 
Ntrl(51-1 10)] behaves hke the functional complex of both 
proteins, as it exhibits a maximal ATP hydrolysis rate, 
which is about twice as high compared with the complex 
of both proteins (18.79 ± 0.66 |iM/min). In the size 
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Figure 2. ATPase and helicase activity of different Prp43/Ntrl complexes and fusion proteins. (A and B) ATPase activity of Prp43, 
Prp43/Ntrl(51-110) and Prp43/Ntrl(l-120) in absence (A) and presence of A20-RNA (B). The ATP hydrolysis rate (y-axis) is shown 
with respect to the ATP concentration (x-axis). (C and D) ATPase activity for different Prp43-Ntrl fusion proteins in absence (A) and presence 
(B) of A20-RNA. (E) Helicase activity of Prp43, Prp43/Ntrl complexes and the Prp43-Ntrl(51-110) fusion protein. The maximal unwinding velocity 
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Table 1. ATPase activities of Prp43 and the complexes Prp43/Ntrl(l-120) and Ntrl{51-110) 



Protein 




In absence of RNA 


In presence 


of A20-RNA 


Km (hM) 


Vmi,x (|iM/min) 


Km (liM) 


Vmax (liM/min) 


Prp43 


11.07 ± 2.07 


2.43 ± 0.12 


9.23 ± 0.53 


36.47 ± 0.51 


Prp43/Ntrl(l-120) 


16.37 ± 1.81 


11.18 ± 0.34 


27.95 ± 1.25 


166.48 ± 2.38 


Prp43/Ntrl(51-110) 


13.42 ± 1.93 


9.54 ± 0.37 


22.48 ± 1.50 


153.40 ± 3.08 
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exclusion chromatography, the fusion protein Prp43- 
Ntrl(51-110) eluted as a monomer, indicating an 
intramolecularinteraction of the G-patch motif and 
the helicase molecule of the the fusion protein. 
(Supplementary Figure S3). Comparison of the ATPase 
activity revealed similar ATPase activities of the fusion 
proteins Prp43-Ntrl(l-120), Prp43-Ntrl(51-120) and 
Prp43-Ntrl(51-110). Furthermore, the ATPase activity 
of the fusion protein Prp43-Ntrl(51-110) turned out to 
be about twice as high as of the prepared complex 
Prp43/Ntr 1(5 1-110), in the absence as well as in the 
presence of A20-RNA. By further decreasing the size of 
the Ntrl -fragment fused to Prp43, the ATPase activity 
decreased dramatically in both cases (Figure 2C and D, 
Table 3). To exclude that this effect was due to a too short 
linker, the ATPase activity of the fusion protein Prp43(l- 
745)-Ntrl(51-l 10) was determined, which resulted in an 
activity significantly higher than for the fusion proteins 
with smaller Ntrl fragments. The fusion protein 
Prp43(l-745)-Ntrl(51-110), which lacks the 22 flexible 
C-terminal residues of Prp43, showed comparable 
activities, even when a shorter linker was used. 

Using the stimulatory effect on the ATPase activity of 
Prp43 on Ntrl binding, an approximate determination of 
the binding constants of Ntrl(l-120) and Ntrl(51-110) to 
Prp43 under steady-state turnover conditions was 
possible. Therefore, the ATPase activity of Prp43 was 
measured in the presence of different concentrations of 
the truncated Ntrl variant (Supplementary Figure S4A 
and B). The dissociation constant of the Prp43/Ntrl(l- 
120) complex (1.49 ± 0.05 |iM) turned out to be slightly 
lower than that of the Prp43/Ntrl(51-1 10) complex 



Table 2. Unwinding activities for Prp43, Prp43/Ntrl complexes and 
Prp43-Ntrl(51-110) fusion proteins 



Protein v^i„ (nM/min) 



Prp43 


n.d. 


Prp43/Ntrl(l-120) 


30.0 ± 4.8 


Prp43/Ntrl(51-110) 


31.8 ± 4.2 


Prp43(58-767)/Ntrl(51-l 10) 


29.4 ± 2.4 


Prp43-Ntrl(51-110) 


127.2 ± 8.4 


Prp43 K418E-Ntrl(51-110) 


127.1 ± 10.3 


Prp43-Y595A-Ntrl(51-110) 


106.3 ± 7.3 


Prp43 Y610A S614A-Ntrl(51-1 10) 


100.2 ± 4.7 


Prp43 Y630A-Ntrl(51-110) 


125.9 ± 8.4 


Prp43-Ntrl(51-110) K67E 


123.4 ± 7.2 



(2.24 ±0.11 i^M). This indicates that residues within the 
first 50 residues of Ntrl are important for binding to 
Prp43, but not for the activation of the hehcase. 

Taken together, Ntrl(51-110) is sufficient for the full 
activation of Prp43. Furthermore, both Ntrl(l-120) and 
Ntrl(51-110) stimulate not only the helicase, but also the 
ATPase activity of Prp43. 

Protein-protein cross-linking indicates that Ntrl interacts 
with the C-terminal domains of Prp43 

In the absence of high-resolution 3D data derived from 
crystallization and/or nuclear magnetic resonance experi- 
ments, mass-spectrometric (MS) approaches can provide 
information about the spatial rearrangement of protein- 
protein and protein-hgand complexes. We set out to gain 
structural information on how Ntrl and Prp43 interact by 
using cross-linking and subsequent analysis of the cross- 
hnked protein regions by mass spectrometry. We studied 
the binary complex by chemical cross-linking with BS3 
and BS2G, which hnk primary amines, in this case lysine 
residues, with an 11.4 A and a 7.4 A spacer arm, respect- 
ively. The cross-linked complexes were separated by SDS- 
PAGE and cross-hnked peptides were identified by MS 
after in-gel digestion. 

In total, 42 intramolecular and 16 interprotein cross- 
links were identified (Figure 3B and Supplementary 
Table S3). The intramolecular cross-hnks located within 
the crystal structure of Prp43 were well within the 
maximum distance that the cross-linkers can span, thus 
providing a positive control for the method 
(Supplementary Figure S5). It is important to point out 
that cross-hnked residues do not directly contact each 
other, but provide information about proximal locations. 

Notably, aU of the interprotein cross-links between 
Prp43 and Ntrl(l-20) are located within the C-terminal 
OB-fold domain of Prp43 (635-767, Figure 3B), whereas 
the corresponding cross-linked sites within Ntrl(l-120) 
are more widely distributed throughout the protein's 
sequence and also include K67 within the G-patch motif 
(Figure 3B and Supplementary Tables S3 and S4). Many 
of the cross-hnks at the C-terminal region of Prp43 
involve lysine residues located close to the C-terminus of 
Prp43 (residues 746-767), a region that is not seen in the 
crystal structure and is thought to be very flexible (shaded 
in Figure 3C). 

Interestingly, K668, which lies in a loop connecting two 
beta-strands of the OB-fold domain of Prp43 and is not 



Table 3. ATPase activities of different Prp43-Ntrl fusion proteins 



Protein 


In absence of RNA 


In presence 


of A20-RNA 


Km (\iM) 


Vmi,x (nM/min) 


Km (hM) 


Vmi,x (|iM/min) 


Prp43-Ntrl(51-110) 


39.53 ± 3.92 


18.79 ± 0.66 


73.43 ± 12.17 


271.90 ± 19.45 


Prp43(l-745)-Ntrl(51-110) 


28.33 ± 4.61 


13.59 ± 0.87 


50.18 ± 9.50 


225.92 ± 16.29 


Prp43-Ntrl(51-102) 


17.50 ± 2.43 


5.50 ± 0.22 


38.74 ± 5.29 


121.33 ± 5.82 


Prp43-Ntr 1(5 1-99) 


10.68 ± 1.38 


5.60 ±0.18 


30.68 ± 6.16 


100.56 ± 6.61 


Prp43-Ntrl(59-110) 


8.77 ± 1.09 


4.02 ± 0.12 


43.06 ± 2.36 


98.10 ± 1.95 


Prp43-Ntrl(62-110) 


10.62 ± 2.18 


2.38 ± 0.12 


48.07 ±11.49 


29.35 ± 2.64 
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Figure 3. Protein-protein cross-linking map and localization on the C-terminal domain of Prp43. (A) MSI and MS2 spectra of cross-linked K60 in 
LTKTYGIGAK [Ntrl(l-120)] to K746 in IKEKVDR (Prp43). MSI spectrum shows the isotopic envelope of the quadruply charged cross-linked 
peptide pair. The MS2 spectrum, zoomed in the region of 0-900 /ij/r for clarity, depicts the identified peptide fragments. (B) Cross-linking map 
depicts both intramolecular and intermolecular cross-links identified in Prp43-Ntrl(l-120). Dark green areas indicate regions not solved in the crystal 
structure. Both BS3 and BS2G cross-links are represented, with a full line for a cross-link identified in both, and differently dashed lines for those 
cross-links identified only with one particular cross-linker. Both cross-linkers, despite not fully overlapping, aim at the same cross-linked regions: the 
very C-terminal region of Prp43 and the beginning of the G-patch motif of Ntrl. (C) Localization of Prp43 cross-linked residues, and rotated surface 
representation. Residues cross-linked to Ntrl are depicted in black, and the residues in the N-terminal domain determined to be protected in the 
footprinting studies are in blue. 



defined in the crystal structure, appears to be an important 
cross-linking hub. This particular position is cross-linked 
to residues Kll and K15 of Ntrl (Figure 3B), indicating 
that this residue is located close to the Ntrl interaction site 
and is flexible. The cross-linked residues within the 
minimal interaction domain of Ntrl (1-120) also exhibit 
a particular feature: K67 in the G-patch motif cross-links 
exclusively to residues K763 and K767 within the region 
closest to the C-terminus of Prp43. Importantly, K60 of 
Ntrl, which lies close to the G-patch motif, cross-Hnks to 
a wider area in the C-terminal part of Prp43 (733-767, 
Figure 3B), encompassing residues that are present and 
others that are not defined in the crystal structure of 
Prp43. Residues Kll and K15, located outside the 
G-patch region, cross-link to the 643-668 region and to 
others as far away as 733-746 (Figure 3B). These cross- 
links indicate that the residues 1-50 of Ntrl, although they 
are not involved in the activation, might also bind to 
Prp43, thereby leading to the observed higher binding 



affinity of Ntrl(l-120) to Prp43 compared with 
Ntrl(51-110). 

These cross-hnking results on Ntrl(l-120) in complex 
with Prp43 could be confirmed by cross-linking experi- 
ments on the minimal functional fragment of Ntrl 
(51-110). Not unexpectedly, a similar cross-linking 
pattern was obtained: K60 and K67 of Ntrl were cross- 
Hnked to the region of Prp43 closest to the C-terminus 
(Supplementary Figure S6 and Supplementary Table S4). 
Most of the observed cross-links originate from K60 of 
Ntrl, joining it to lysine residues between amino acids 
733-763 of Prp43. In addition, we identified a cross-hnk 
between K67 in Ntrl and K766 of Prp43, as well as an 
additional contact between K78 in the G-patch motif of 
Ntrl and K4 in the NTE of Prp43. It is of note that K4 is 
also part of a flexible part of the helicase: it is one of the 19 
amino acid residues encompassing the N-terminal tail, 
which has been shown to be in proximity to the 
C-terminal domain of Prp43 (PDB-ID 2xau). 
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A representation of the cross-linked residues on the 3D 
structure of Prp43 (Figure 3C, Supplementary Figure S8) 
reveals that they are located on a small surface region of 
the helicase on the C-terminal OB-fold domain. Most of 
the residues involved belong to the last 35 residues that are 
flexible and not required in vivo (58). As mentioned earlier, 
the identified residues indicate proximal contacts and not 
direct interactions, and they strongly indicate that the 
G-patch motif interacts with the three C-terminal 
domains of Prp43, namely, the Winged Helix (462-525), 
the Ratchet (521-634) and the OB-fold domain (635-767). 

Binding of Ntrl induces domain movements in Prp43 

Having identified the C-terminal domains of Prp43 as a 
major interaction site of Ntrl, we next asked whether the 
binding of Ntrl to Prp43 might induce changes in the 
conformation of Prp43 that could lead to the enhanced 
RNA unwinding activity (33). Monitoring solvent acces- 
sibiUty changes of residues of Prp43 that are induced by 
Ntrl binding can help identify changes in the overall con- 
formation. Hydroxy! radical probing of proteins yields 
information on relative solvent accessibility, as OH 
radicals can covalently modify side chains of amino acid 
residues (40). Therefore, we probed Prp43 alone and the 
complex Prp43/Ntrl(l-120) with hydroxyl radicals 
generated by the Fenton reaction in a time-dependent 
manner. Proteins were liydrolysed with the endopro- 
teinase trypsin, and the modified peptides and amino 
acids were identified by mass spectrometry. By quantita- 
tively comparing the rates of oxidation of a particular 
peptide in two states, i.e. bound and unbound, informa- 
tion about the relative solvent accessibiUty of the two 
states can be obtained. Particular residues at which the 
modification occurred can be identified and quantified 
by tandem mass spectrometry (Supplementary Figure 
SIA). 

In total, 44 peptides in Prp43 were identified and could 
be quantified, of which 8 peptides showed differences in 
their levels of oxidation when Ntrl(l-120) was bound to 
Prp43 (Supplementary Figure S7). These differences in 
modification levels concern different regions on Prp43, 
and several residues of the N-terminal extension (NTE) 
of Prp43 showed differences on binding of Ntrl(l-120). 
The residue El 6 was less strongly oxidized (modified to 
6.1 ± 1.9% in Prp43, and only to 2.7 ± 0.9% in the 
complex), which could have been due either to direct 
binding and thus protection by Ntrl(l-120), or to a 
movement 'into' the C-terminal domain and thus away 
from the surface of the protein. Additionally, residues 
E40, H44 and H57 of the NTE of Prp43 showed a 
higher oxidation level on the binding of Ntrl(l-120), 
which indicates that they changed their position during 
complex formation, probably in co-ordinated movement 
with the aNl helix (Figure 4B). 

To take a closer look at the function of the NTE of 
Prp43, we expressed and purified the variant Prp43(58- 
767), which lacks the 57 N-terminal amino acids of the 
hehcase. It turned out that this truncation shows no 
ATPase activity in the absence of Ntrl, but is still able 
to bind Ntrl(l-120). The highest observed unwinding rate 



of the Prp43(58-767)/Ntrl(5I-110) complex was 
29.4 ± 2.4nM/min, and hence in the same range as for 
the Prp43/NtrI(51-110) complex (Figure 2C and 
Table 2). This result suggests that the observed 
movement of the N-terminal domain of Prp43 is rather 
passive in nature and is due to the binding of Ntrl to 
the C-terminal domain. As the aNl -helix (19-33) interacts 
directly with the winged helix domain (457-521), the latter 
domain is also affected by the rearrangements that occur 
on binding of Ntrl. 

Additionally, differences in surface accessibifity could 
be detected in the C-terminal OB-fold domain, where 
W693 was not modified in Prp43 alone, but was 
modified to 90.0 ± 9.5% in the complex, and in the 
ratchet domain, where H587 was not modified in 
isolated Prp43, but was modified to 6.0 ± 1.9% in the 
complex. These changes in oxidation seem to be a result 
of Ntrl(l-120) binding to the C-terminal domain of 
Prp43, which induces movements in both domains 
(Figure 4C). 

Taken together, our probing experiments indicate that 
there are major rearrangements in the OB-fold domain of 
Prp43 on binding of Ntrl. This is also where we located 
the binding platform of the G-patch motif on Prp43 ac- 
cording to the protein-protein cross-finking results. 
Moreover, changes in solvent accessibility are observed 
for amino acid residues in the NTE of Prp43, and 
deletion of the NTE affects the intrinsic ATPase activity 
of Prp43, while additional experiments showed that the 
NTE of Prp43 is not required for the binding of or the 
activation by Ntrl. 

Frotein-RNA cross-linking reveals residue-specific RNA 
contacts in Prp43 and Ntrl 

Ntrl binding to Prp43 leads to increased unwinding 
activity, which is why we next set out to investigate the 
interaction of the helicase with RNA, with and without 
Ntrl, by UV-induced protein-RNA cross-finking. UV 
fight (254 nm) generates a covalent fink between a 
nucleic acid base and the side chain of an amino acid 
when they are in proximity. The newly formed covalent 
bond on the amino-acid residues, a zero-length cross-fink, 
makes it suitable for detection by mass spectrometry, and 
sequence information about the cross-finked protein 
region can be obtained (46). 

UV-induced cross-linking was initially analysed by 
SDS-PAGE by using a 21-mer ssRNA labelled with y- 
[^¥]-ATP. Both Prp43 and Ntrl (1-1 20) can cross-link 
to RNA, although Prp43 seems to do so more efficiently 
(Figure 5A). 

For the identification of the cross-finked protein regions 
by MS, we digested the cross-linked Prp43-RNA complex 
with RNAses and trypsin, and enriched for RNA cross- 
linked peptides by chromatography on Ti02 columns. The 
mixture was desalted and immediately analysed by MS. 
Eight different Prp43 peptides cross-linked to RNA were 
identified. For six of them, the cross-linked amino acid 
residue could be also identified (Table 4, Supplementary 
Figure Sll). These amino acids are within the RecA-I 
(Ml 54, Y174), RecA-2 (K418, Y440) and ratchet (Y595, 
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Table 4. Protein RNA cross-linlcs identified in free and bound Prp43 
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u/1 y /o) 


RecA-l 


M154 


'^'vaamsvaqr'^'' 
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+ 
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RecA-1 


Y174 


""lgeevgysir'" 


UG 


+ 


+ 
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U 


+ 


+ 


n.d. 


RecA-2 


K418 
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+ 




6.3 


RecA-2 
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UG 


+ 


+ 


3.7 


Ratchet 


Y595 
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UG 


+ 


+ 


132.2 


Ratchet 


¥630 


""ynlelnttdyespk'"'' 


GU 


+ 


+ 


7.3 


Ratchet 


n.i 


'^"^DHYLNYRSLSAADNIR*'^" 


GU 


+ 


+ 


n.d 


Ratchet 


Y610 


'""^DHYLNYR*"" 


U 


+ 


+ 


339.6 


Ratchet 


S614 


"'■-SLSAADNIR*'-'' 


UG 


+ 




169.7 



n.i.: not identified; n.d.: non-determined; (-)cross-link was not sequenced, identification based on MSI: b: bound; f: free. The domain and the cross- 
linked residue (when identified) are indicated, as well as the RNA oligonucleotide. 



Y630, Y610, S614) domains, and they are oriented 
towards the proposed nucleic-acid-binding cavity (36). 

To elucidate the functional relevance of residues 
identified by the UV cross-Hnking, several mutated 
variants of the Prp43-Ntrl(51-110) fusion proteins were 
prepared and analysed. The helicase activity was not 
affected in case of the K418E and Y630A mutations. In 
contrast to this, the unwinding activity was significantly 
decreased in case of the Y595A mutant and the Y610-S614 
double mutant, confirming the central role of the long 
ratchet helix in the unwinding process. 

When the peptide-RNA cross-hnking results for Prp43- 
RNA are compared with those for Prp43/Ntrl(l-120)- 
RNA, no quahtative differences were observed, meaning 
that the same cross-linking sites to RNA were identified in 
the presence or absence of Ntrl. Notably, a cross-hnk of 
the residue K67 of Ntrl (1-1 20) to RNA was identified, 
confirming that this protein is also in direct contact with 
RNA, in agreement with the SDS-PAGE results. In par- 
ticular, this shows that the G-patch motif (K67) interacts 
with RNA. On testing a Prp43-Ntrl(51-1 10) fusion 
protein containing the mutation K67E, it turned out 
that the helicase activity was not significantly affected by 
this mutation, indicating that this contact is not essential, 
at least for the unwinding of the used substrate. As the 
precise target sequence of Prp43 during the disassembly of 
the intron-lariat spficeosome has not been identified yet, it 
cannot be ruled out that Ntrl-K67 as well as K418 and 
Y630 might be essential in vivo. 

The two helicase-RNA complexes, with and without 
Ntrl(l-120), were then compared quantitatively. A re- 
markable difference was observed in the cross-linking 
yield of all identified peptide-RNA cross-hnks, which 
was found to be more efficient when Prp43 was cross- 
linked to RNA in the presence of Ntrl. In particular, 
peptide regions encompassing amino acids 591-601, 605- 
611 and 612-620 (SDEAYEYGIHK, DHYLNYR and 
SLSAADNIR, with Y595, Y610 and S614 cross-linked, 
respectively), which are located in the ratchet helix, were 
found to be cross-linked with a significantly higher effi- 
ciency as compared with Prp43 alone bound to RNA 
(as calculated from ratio of the intensities of the 



corresponding non-cross-linked peptide in Prp43 alone 
and bound to RNA, Table 4). We note that an increase 
in UV-induced cross-hnking yield cannot directly be 
correlated with RNA affinity at the particular protein 
region. Nevertheless, it does reflect a shift in equihbrium 
towards certain conformations that favour RNA cross- 
linking, in which the RNA is in close spatial proximity 
with the amino acid residues of this particular domain. 
Importantly, those regions where we observed a higher 
cross-hnking yield on Ntrl(l-120) binding coincide with 
those that showed increased solvent accessibility on 
Ntrl (1-120) binding to Prp43 (Table 4 and Figure 4B). 
Thus, we propose that the structural rearrangements 
that occur on Ntrl(l-120) binding allow an increased 
availabihty of the nucleic acid-binding cavity, an effect 
that is evidenced both by an increase in RNA cross- 
linking yield and by increased solvent accessibihty of 
amino acids in the ratchet domain. 



DISCUSSION 

We have reported here that the G-patch motif of Ntrl is 
intrinsically unstructured, but it can potentially adopt sec- 
ondary structure elements on binding to Prp43. 
Ntrl(51-110) is the minimal functional fragment 
required for the complete activation of Prp43, and it 
stimulates both, the hehcase and the ATPase activity of 
Prp43. Protein-protein cross-Hnking in combination with 
mass spectrometry has revealed that Ntrl interacts with 
the C-terminal domains of Prp43. Even more, a protein in 
which Ntrl(51-110) is directly fused to the C-terminus of 
Prp43 behaves as a functional complex of both, exhibiting 
an increased helicase activity. RNA binding contacts were 
investigated using UV-induced protein-RNA cross- 
Hnking, and single amino acid residues were identified 
that contact RNA directly. The results presented here 
confirm the crucial participation of the long ratchet helix 
in RNA binding, and present evidence that Ntrl is also 
involved in the interaction with the RNA. 

Several structure-function studies for helicases in 
complex with their activator protein(s) are available, 
providing different levels of interaction and activation. 
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For example, the activities of the SFl heUcase Upfl are 
stimulated by its interaction partner Upf2 (59). Upf2 is an 
intrinsically unstructured protein that forms a beta-hair- 
pin, and an alpha-helix on interaction with Upfl, de- 
creases the RNA affinity of Upfl and thereby stimulates 
its unwinding activity (60,61). 

The ATPase and helicase activities of the DEAD-box 
protein eIF4A are stimulated on binding of eIF4G (62). 
On binding of elF4G, the two RecA-like domains of 
elF4A become closer to each other, and therefore the 
affinities for RNA and ATP are increased (63). 

Another level of activation has been suggested for the 
DEAD-box protein Dbp5. The interaction partner Glel 
was shown to decrease the RNA affinity after strand sep- 
aration, and thus enhances the release of the substrate 
(64). The structure analysis of a Dbp5/Glel complex has 
revealed that the interaction is mediated by the small 
molecule inositol hexakisphosphate, which is required 
for stable complex formation (65). 

These examples show the existence of different binding 
modes of helicase activation by interaction partners based 
on either direct interactions with the RecA-like domain or 
on interactions with associated domains hke the CH- 
domain of Upfl . Furthermore, different levels of regula- 
tions have evolved, e.g. folding on interaction with a 
heUcase, stabilization of a productive conformation, im- 
plication of small molecules in the complex formation and 
the involvement of several interaction partners. 

Although the four spliceosomal DEAH-box helicases in 
yeast Prp2, Prpl6, Prp22 and Prp43 show a high sequence 
similarity in their helicase core as well as in their 
C-terminal domains, the N-terminal region that precedes 
the first RecA-hke domain is not conserved. 

Prp43 has the shortest NTE (89 residues) compared 
with Prp2 (220 residues), Prpl6 (347 residues) and Prp22 
(480 residues). The first 91 amino acids of Prp43 were 
shown to be dispensable in vivo (58), indicating that they 
are not involved in the activation of Prp43. We have 
shown here that the deletion of the N-terminal alpha- 
helix by which the NTE binds to the winged helix 
domain affects the intrinsic ATPase activity of Prp43. 
We furthermore could show that the first 58 residues of 
Prp43 are not required for the activity in vitro if Ntrl is 
present. 

The NTE of Prp2 and Prpl6 was shown to be function- 
ally dispensable in vivo (66,67). The role of the NTE in 
Prp22 is more ambiguous: the first 260 are not required, 
whereas the residues 262-350 are able to enhance the 
function of Prp22 in vivo (68). Moreover, the deletion of 
the complete NTE in Prp22 leads to higher ATPase and 
helicase activities in vitro, but to a defect in its 
spliceosomal function (68). This suggests that the NTE 
of Prp22 is required for the recruitment to the 
spliceosome. 

There is a fundamental difference between Prp22 and 
Prp43: while Prp43 is dependent on the presence of Ntrl 
for both, recruitment and activation (31), Prp22 shows 
intrinsic ATPase and helicase activity in vitro and presum- 
ably only requires interaction partners for the recruitment 
to the spliceosome (69). Comparisons of the C-terminal 
domains of hPrp22 and yPrp43 with homology models 



from hPrp2, hPrpl6 and hPrp43 reveal that their overall 
fold is similar, but that patches of different surface charge 
exist, suggesting that this contributes to the specific inter- 
action with other proteins (37). It can be speculated that 
differences in the C-terminal domain of Prp22 and Prp43 
lead to the different properties of both proteins. 

The ATPase activity rate of Prp43 is stimulated by 
RNA, both in presence and absence of Ntrl (70,33). We 
could show here that the extent of stimulation is in the 
same range in absence and in presence of the truncated 
Ntrl variants. Hence, RNA seems to influence the ATP 
affinity and/or the mechanism of ATP hydrolysis of 
Prp43. 

The crystal structure of the Prp43/ADP complex repre- 
sents a post-reaction state that cannot accommodate ATP 
in its active site (35,36). A comparison of the helicase 
core of Prp43 with the HCV NS3 helicase in 
its apo = nucleotide-free and in its ATP-bound state 
suggests that major conformational changes are required 
within the helicase core of Prp43 to aUow ATP binding 
(Supplementary Figure S9). The overlay of Prp43 with 
NS3/ATP/RNA shows that major changes occur within 
the RecA-2 domain, which lead to an enlargement of the 
nucleotide binding pocket and to a movement of the beta- 
hairpin that protrudes from this domain. These changes 
might be induced by a cooperative binding of the RNA 
substrate as weU as of Ntrl. 

Using UV-induced protein-RNA cross-hnking, we have 
identified eight residues within Prp43 that interact directly 
with the RNA. Of these residues, four are conserved in the 
three other spliceosomal DEAH-box proteins Prp2, Prpl6 
and Prp22, namely, Y174 of the RecA-1 domain, K418 of 
the RecA-2 domain and Y595 and Y610 within the ratchet 
domain. Y595 and Y610 were furthermore identified to be 
important for the helicase activity, as their mutation leads 
to a decreased unwinding activity, in agreement with the 
increased RNA cross-hnking yield determined for these 
residues. 

Although it was reported that the ATPase activity of a 
truncated Prp43 variant lacking the OB-fold domain was 
not stimulated in the presence of RNA (36), no cross-hnks 
were identified within the OB-fold domain of Prp43. This 
could be due to the fact that a single-stranded RNA sub- 
strate was used here, which only included five uridines. In 
general, most cross-links are found with uridines and 
rarely with a guanine or a cytidin base. Hence, the bases 
that potentially were interacting with the OB-fold domain 
may have been unsuitable for cross-linking, as the uridine 
nucleotides are located in the center of the used oligo- 
nucleotide. Furthermore, it is possible that the OB-fold 
domain only binds dsRNA, and therefore failed to 
interact with the ssRNA. 

In addition to the cross-links that were found within 
Prp43, a cross-fink between the RNA and the residues 
K67 of Ntrl was identified, providing the first experimen- 
tal evidence of a residue within the G-patch motif directly 
interacting with RNA. As we could not detect any RNA 
affinity of isolated Ntrl in an EMSA (data not shown), 
the interaction appears to be possible only when Ntrl is 
bound to Prp43. Although the mutation K67E did not 
influence the unwinding activity in vitro, it cannot be 
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ruled out that this residue is essential in vivo. Strikingly, 
K67 of Ntrl is conserved within in the G-patch motifs of 
the second activator of Prp43, Pfal and of the human 
Ntrl orthologue, TFIPll. Interactions of G-patch 
motifs with RNA have already been reported for the 
protein TgDRE from the parasite Toxoplasma gondii 
using fluorescence anisotropy measurements (71) and for 
the proteinase from the Mason-Pfizer monkey virus in 
electrophoretic mobihty shift assays (72), but no 
concrete residues responsible for interaction were 
reported. 

Ntrl stimulates the helicase activity of Prp43, and the 
N-terminal fragment Ntrl(l-120) is sufficient to mediate 
this interaction in vitro (33). These findings were confirmed 
in this study, and we could demonstrate that the residues 
51-110 of Ntrl are required and sufficient to activate 
Prp43. Furthermore, we could show that the two frag- 
ments Ntrl(51-1 10) and Ntrl(l-120) are able to stimulate 
the ATPase activity of Prp43. This result is in contrast to 
the finding of Tanaka and co-workers who detected no 
difference in the ATPase activity of Prp43 in absence 
and presence of full-length Ntrl. Although we used here 
only a fragment of Ntrl, it seems unlikely that additional 
parts of Ntrl can inhibit the stimulatory effect on the 
ATPase activity. Moreover, the G-patch protein Pfal 
was also shown to stimulate both the helicase and the 
ATPase activity of Prp43 during ribosome biogenesis 
(73); therefore, it was suggested that both activities are 
coupled. The action of Prp43 during the spliceosomal dis- 
assembly is most probably regulated in the same manner, 
meaning that Ntrl not only stimulates the hehcase but 
also the ATPase activity of Prp43. 

The fragment Ntrl(l-122) was shown in yeast two- 
hybrid studies to be able to interact with Prp43 (74). 
Additionally, the fragment Ntrl(l-120) was suggested to 
be sufficient for the stimulation of the helicase activity 
of Prp43 (33). In vivo studies have revealed that 
the N-terniinal 33 residues of Ntrl seem to be 
dispensable (75). 

On characterization of the helicase and the ATPase 
activities, it was shown here that both Ntrl(l-120) and 
Ntrl(51-110) are able to stimulate Prp43 to the same 
extent. Moreover, by fusing different fragments of Ntrl 
to Prp43, it turned out that for all fragments smaller than 
Ntrl(51-110), the ATPase activity drops dramatically. 
Hence, Ntrl(51-110) seems to be both sufficient and 
required for the activation of Prp43. A sequence alignment 
of the two different activator proteins of Prp43, Ntrl and 
Pfal, reveals that conserved residues are present in the 
range of residues 51-103 of Ntrl. This explains why 
Ntrl(59-110) was not able to completely stimulate 
Prp43. Admittedly, further studies have to be performed 
to investigate whether the residues 104-110 of Ntrl might 
be dispensable for the complete activation. Furthermore, 
the cross-hnking results as well as the determination of 
binding constant indicate that also some of the first 50 
residues of Ntrl are involved in interactions with Prp43, 
although these contacts are not responsible for the activa- 
tion itself. 

In vivo studies have revealed that the C-terminally 
truncated Prp43(l-732) is still functional, whereas 



Prp43(l-722) leads to a temperature-sensitive phenotype 
(58). Furthermore, a truncated Prp43 that lacked the 
complete OB-fold domain, Prp43(l-657), was shown to 
be unable to interact with the G-patch containing 
C-terminal domain of Pfal in vitro (36). SAXS studies of 
the Prp43/Pfal complex have proposed that the 
C-terminal domains of Prp43 are involved in the inter- 
action with Pfal (76). These findings are in agreement 
with the protein-protein cross-hnking results reported 
here, which indicate that the C-terminal domains of 
Prp43 are responsible for the binding of Ntrl. If the 
OB-fold domain is removed, apparently the overall struc- 
ture of this region is disturbed, precluding the binding of 
Ntrl. Strikingly, it has also been shown for the DEAH- 
box helicase Prp2 that the interaction with the G-patch 
protein Spp2 can be disturbed by mutations in the 
OB-fold domain (30). 

Although the used cross-linkers were of medium length 
and the observed cross-links can correspond to maximal 
distances of the C-alpha atoms of ~20 and 24 A, respect- 
ively, only cross-links in a small surface area of Prp43 
were identified, proposing that the binding platform of 
the G-patch motif is located in proximity to this limited 
region (Supplementary Figure S8). 

Although no 3D structure of a G-patch motif has been 
reported so far, it was suggested that the G-patch motif 
within the DNA repair protein TgDRE from T. gondii 
lacks any secondary structure elements (71). The CD 
spectra of Ntrl(51-110) presented here indicate the 
same, but also reveal that Ntrl(51-110) has the potential 
to evolve secondary structure elements on interaction with 
Prp43. Therefore, Ntrl could belong to the proteins of 
'dual personahty', which have been proposed to be able 
to perform transitions between an unstructured and a 
structured state (77), such as the protein Upf2, which 
forms an alpha-helix and a beta-sheet upon binding to Upf 1 . 

In summary, the following model for the activation of 
Prp43 by Ntrl can be proposed (Supplementary Figure 
SIO): the overhang of a dsRNA could bind within the 
proposed nucleic binding cavity and interact strongly 
with the long ratchet helix, as proposed for the Ski2-hke 
helicase Hel308 (34,36). This binding event could induce 
structural changes, especially within the RecA-2 domain, 
which lead to the remodelling of the nucleotide binding- 
pocket to facihtate ATP binding and to a movement of the 
beta-hairpin. Subsequently, the binding of Ntrl could 
adjust the conformation of the three C-terminal domains 
in such a way that it interacts with the dsRNA substrate as 
well as with the P-hairpin. The binding of Ntrl to the 
OB-fold domain could induce structural changes also in 
the helicase core domains including the P-harpin, which 
lead to a higher productivity of Prp43 and thereby to an 
elevated ATP hydrolysis rate even in absence of RNA. 

On hydrolysis of ATP, the conformation of the RecA-2 
domain changes and thereby the overhang of the dsRNA 
could be pulled further into the binding cavity, presum- 
ably supported by the long ratchet helix. The beta-hairpin 
could, eventually in a concerted action with the OB-fold 
domain, act as steric hindrance for the double-stranded 
part of RNA and therefore induce the separation of 
base pairs in this region. The subsequent release of ADP 
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and the binding of ATP could again lead to structural 
changes, provoking an at least partially processive un- 
winding of the dsRNA as proposed in the backbone 
stepping motor model for the HCV NS3 helicase (78). 
The fine details of this activation will be revealed 
when additional crystallographic and functional studies 
succeed and further describe the molecular details of the 
interaction. 
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